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QUASISTATIC DYNAMICAL SYSTEMS 


NEIL DOBBS AND MIKKO STENLUND 

Abstract. We introduce the notion of a quasistatic dynamical system, which general¬ 
izes that of an ordinary dynamical system. Quasistatic dynamical systems are inspired 
by the namesake processes in thermodynamics, which are idealized processes where the 
observed system transforms (infinitesimally) slowly due to external influence, tracing 
out a continuous path of thermodynamic equilibria over an (infinitely) long time span. 
Time-evolution of states under a quasistatic dynamical system is entirely deterministic, 
but choosing the initial state randomly renders the process a stochastic one. In the 
prototypical setting where the time-evolution is specified by strongly chaotic maps on 
the circle, we obtain a description of the statistical behaviour as a stochastic diffusion 
process, under surprisingly mild conditions on the initial distribution, by solving a well- 
posed martingale problem. We also consider various admissible ways of centering the 
process, with the curious conclusion that the “obvious” centering suggested by the initial 
distribution sometimes fails to yield the expected diffusion. 
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1. Introduction 

This paper belongs to the wider context of studying statistical properties of dynamical 
systems using probabilistic techniques. We are going to propose a new, but very natural, 
class of non-stationary dynamical systems, and to implement methods from probability 
theory (e.g., coupling) and stochastic analysis (a martingale problem) in their investiga¬ 
tion, alongside methods from ergodic theory and dynamical systems. 

1.1. Motivation and abstract setup. This paper grew out of the will to understand the 
following abstract setup. Consider a system 5 in contact with an ambient system ©. The 
joint system (s, ©) is too large and complex to analyze, while s, the observed subsystem 
of actual interest, is more amenable to scrutiny. The observed system s and the ambient 
system © are allowed to interact, with the following constraints: (i) due to the size of 
the latter, the evolution of 5 has a negligible effect on that of ©, and (ii) s is robust in 
comparison to the strength of the interaction. By (ii) we mean that the characteristics 
of s change very slowly, yet in such a way that the cumulative change over a very long 
time could be enormous. 

At the conceptual level, the problem is to characterize the properties of s over a (long) 
lapse of time, at least in nontrivial special cases. In particular, it is not always reasonable 
to expect that the subsystem s reaches or remains in equilibrium under the perpetual 
influence of ©. Rather, it might slowly pass through small vicinities of equilibria, in a 
continuous manner. Given an absence of equilibrium states, the question arises as to how 
to describe statistical properties of such a system. 
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In order to model the above setup, we propose the notion of a quasistatic dynamical 
system in Section 1.2. In Section 1.3 we introduce a means of describing properties of 
quasistatic dynamical systems via stochastic diffusion processes. We follow this with a 
presentation of continuous-time quasistatic dynamical systems and subsequently a discus¬ 
sion of relevant literature. In Section 2 we present a specihc class of quasistatic dynamical 
systems, whose properties we investigate in the remainder of the paper. Section 3 contains 
the statements of theorems, with an overview of the proofs in Section 4. 

1.2. (Discrete-time) Quasistatic dynamical systems. Throughout the paper, we use 
the standard notation 

{u} = u (mod 1 ) and [uj = u — {u} 
for the fractional and integer part of a real number u > 0, respectively. 

Definition 1.1. Let X be a set and M a collection of self-maps T : X ^ X equipped 
with a topology. Consider a triangular array 

T = {T„ gAI : 0 < k < n, n > 1} 

of elements of M.. If there exists a piecewise-continuous curve 7 : [0,1] ^ A1 such that 

lim Tn,\nt\ = It , t e [ 0 , 1 ] , 

n^oo 

we say that (T, 7 ) is a quasistatic dynamical system (QDS). 

Definition 1.2. We call X the state space [also phase space) and Xi the system space 
of the QDS. 

Obviously a QDS is a generalization of an ordinary dynamical system: a discrete-time 
dynamical system is determined by a single map T : X ^ X, which is the degenerate QDS 
obtained by setting T„ ^ = T for all k and n. (Then 'jt = T for all t.) Below we will also 
provide the dehnition of a continuous-time QDS which similarly generalizes the notion of 
an ordinary continuous-time dynamical system (semiflow). One can readily come up with 
further generalizations of the dehnitions given, but spelling them out here seems to add 
little of substance for the purposes of this paper. 

Let us justify the dehnition above. An element of the state space X represents the state 
of the system, whereas the triangular array T describes the dynamics: given the initial 
state X ^ X and integers 1 < k < n, the image 

^n,k O • • • O G X 

is the state of the system after k steps on the nth level of the array T. Passing to the 
continuous time parameter t G [0,1] via the scaling k = \nt\ , we note that the piecewise- 
constant curve 1 1 —)■ approximates 7 in the system space A4, and this approximation 

converges (pointwise) as n —)■ 00 . The system 71 differs signihcantly from the 

system 1 70 in typical situations. 

The rate of convergence of T„q„q to 7 ^ will affect the properties of the QDS and may 
need to be specihed when studying a particular system. Limiting properties will certainly 
depend on the limit curve 7 and may or may not depend on the array T, see Section 3. 

One cannot directly study a “limit system”, for letting hrst n ^ 00 one would end up 
just iterating the map 70 and none other. Rather one must study (for example, statistical) 
properties of the QDS at level n and see how those properties behave in the limit. The 
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main goal of this paper is to initiate a systematic study of such systems that would 
eventually result in a comprehensive mathematical theory of QDSs. 

Drawing a parallel with the first paragraph of this section, the limit curve 7 models 
the evolution of the observed system s, as the ambient system & forces s to transform, 
possibly signihcantly, over time. The jumps in 7 allow for singular, abrupt events. For 
example, such events could include jumps from one connected component of the system 
space Ai to another. 

We have borrowed the apt term “quasistatic” from thermodynamics. There it refers to 
idealized processes in which the observed system transforms infinitesimally slowly due to 
external influence. Such a system is in thermodynamic equilibrium at any given time, 
yet traces out a continuous path of different equilibria over an infinitely long time span. 
For instance, all reversible thermodynamic processes are quasistatic. See, e.g., [11,25] for 
more background. The connection with thermodynamics should not be construed as a 
limitation of scope; the definition of QDS is, of course, purely abstract and could be used 
widely according to one’s needs. 

A natural class of QDS is provided by the following example: 

Example 1.3 (Quasistatic billiards). Dispersing billiards on a torus consists of strictly 
convex scatterers with smooth boundaries embedded in the (surface of the) torus. A particle 
moves on the torus, in the exterior of the scatterers, and experiences an elastic collision 
when it meets a scatterer. Assuming that the length of free flight between any two suc¬ 
cessive collisions is uniformly bounded, the location of the particle can be kept track of 
by keeping track of the collisions. For a fixed scatterer configuration K, this leads to a 
representation of the dynamics by a billiard map mapping one collision to the next. 
A model of dispersing billiards with moving scatterers was introduced in [33]: Let (iFfc)^o 
be a seguence of scatterer configurations, such that d{Kk-i, Kk) < s holds uniformly for 
some small £ > 0. Here d is a natural distance on the space fC of admissible configura¬ 
tions. Then FKk,Kk-i represents the dynamics between the {k — l)th and kth collisions, 
during which the configuration has changed by a distance < e, from K^-i to Kk, and the 
compositions FK,,,Kk-i o • • • o Fx^^Kq represent the dynamics for scatterers moving with 
speed < e. This yields a guasistatic dynamical system in the limit e —)■ 0 of infinitesimally 
slowly moving scatterers. More precisely, consider a triangular array of configurations 
{Kn,k ^ A : 0 < k < n, n > 1} with d{Kn,k-i, K^^k) < £n, where hm„^oo = 0. Assume 
moreover that there is a continuous curve 7 ^ : [0,1] ^ /C such that lim^^oo = 7 ^. 

Setting T^^k = kHn k-i 1 < k < n and n > 1 results in a QDS, which we coin 
guasistatic billiards. (The system space AA can be identified with K, x )C, in which case 
the limit curve 7 takes values in the diagonal subspace: 7 * = 


1.3. Statistical description of a QDS. In order to gather information about the state 
Xn,k of the quasistatic dynamical system, one performs a measurement of an observable 
quantity, whose value is determined by the state. Accordingly, the quantity is represented 
by a function / : X —)■ R, called an “observable”, and the outcome of the measurement by 
f{xn,k)- We are going to study the sequence of measurements 

f{Xnfl),f{Xn,l), • • ■J{Xn,n) 

on the nth level of the QDS, corresponding to the initial state Xnp = x, as n ^ 00 . For 
practical reasons, such as sensitive dependence on the initial state (aka chaos), it is natural 
to assume that a; is a random variable with values in X and some distribution p. This 
renders {xn,k)k=o ^ sequence of random variables. By definition, each is completely 
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determined by x, and its distribution is the pushforward^ fin,k = (^n,fc o • • • o 
which will be different for different n and k. In other words, the random variables Xn,k 
are neither independent nor identically distributed. 


The random paths 


t H- Sn{x,t) 



f{.Xn,ls\) ds 


are piecewise-linear interpolations of Birkhoff sums: if nt G M, Sn{x,t) = 

Understanding the distribution of these paths for a large class of observables in the 
limit n ^ oo is a natural goal and can be considered a good statistical description of 
a QDS. It is reasonable to hope that the random paths, once properly centered and 
scaled, converge to a stochastic diffusion process. We shall prove that this indeed hap¬ 
pens for a paradigmatic QDS whose system space consists of strongly chaotic expanding 
circle maps, formally presented in Section 2. The result holds for rather generic / and /r; 
we only ask that / be Lipschitz continuous and /i absolutely continuous. The curve 7 
need not be especially regular; Holder continuity suffices. The limit process will depend 
on 7 and /, but not on p. See Section 3 for the precise statements of the main results. 


To get an idea why passing to the limit n —)■ cxo might yield a diffusion process for the 
paths, consider the following simple example: 


Example 1.4 (A degenerate case). Let T = Tn,k '■ x ^ 2x (mod 1) for all n > 1 and 
< k < n, so the curve 7 is constant, 'yt = T for all t. Define the observable f : [0,1) —?■ M 
by f{x) = —1 for x G [0, |) and f{x) = 1 otherwise. Taking Lebesgue measure for the 
initial distribution fi, the Birkhoff sums model a fair coin toss, or the simple symmetric 
random walk. The paths n~^Sn{x, ■) converge in distribution to the zero path f 1 —)■ 0. The 
random paths n~ 2 Sn{x, •) converge in distribution to standard Brownian motion. 


1.4. Continuous-time QDS. For completeness, we finish this section with a discussion 
on continuous-time dynamical systems. In continuous time, an ordinary dynamical system 
is determined by a semiffow, i.e., a one-parameter family of maps 0^ : X —)■ W, s G M+ = 
[0, cxo), satisfying the semigroup property ocff = 0 ^+^ with 0° = id^. Typical semiffows 
are those generated by a vector field: if the differential equation 

^ = U(|/) with 1/(0) = X 


specified by the vector field U on X has a unique solution y = y{x, s) for all x ^ X, 
then 0 ^( 0 :) = y{x,s) defines a semiffow. Note that ■^(j)^{x)\s=o = V{x). Semiffows can 
also depend on time explicitly, as is the case with a time-dependent vector field: if the 
differential equation 


ds 


= V{y,s) with y{r) = X 


specified by the time-dependent vector field U( •, s) on X has a unique solution y = 
y{x,r,s) for all x G X and all r > 0, then (p^’^{x) = y{x,r,s) defines a time-dependent 
semiffow; see below. Note that ^(p^’^(x)|<j=o = V{x,r). Time-dependent semiffows on X 
become (ordinary) semiffows on X x M_|_ simply by keeping track of the time-coordinate 
explicitly. More precisely, let pi and p 2 be the canonical projections from X x M+ to X 
and R+, respectively. Then a semiffow 0^ : X x M+ —)■ X x M+, s G M+, having the 


^Assuming the map T is measurable, the pushforward of a measure y, is defined by Dfj,{E) = y(T ^E) 
for all measurable sets E. 
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property p 2 ( 0 ^(x,r)) = r + s defines the two-parameter family of maps : X —)■ X : 
= pi( 0 ®(x,r)), (r, s) G having the characteristic property 

^r,s+u ^ ^r+s,u ^ = id^ ( 1 ) 

of time-dependent semifiows. Note that (1) is consistent with the following interpretation; 
given the state x of the system at time r, is the state s time nnits later. Conversely, 

given a time-dependent semifiow on X, 

0 ^(x, r) = r -f s) ( 2 ) 

determines a semiflow on X x M+. 

Definition 1.5. Let X be a differentiable manifold and M a eollection of vector fields 
on X equipped with a topology. Let n be a parameter taking values either in Z_|_ or R_(.. 
Consider the array 

V = {Vn{ ■ ,r) e M : 0 < r < n, n > 0} 

and assume that, for each n > 0 , the time-dependent vector field Vn{- ,r) determines a 
time-dependent semifiow , 0<r<r-|-s<n. If there exists a piecewise-continuous 

curve 7 : [0,1] —A1 such that 

lim Vn{ ■ ,nt) = 'yt , t e [ 0 , 1 ] , 

n^oo 

we say that (V, 7 ) is a continnons-time QDS. 


Definition 1.6. The nomenclature of Definition 1.2 continues to apply in the continuous¬ 
time context. We also say that the array $ = {(p'ffi : 0<r<r-f-s<n, n>0}zsa 
qnasistatic semifiow. 


The convergence condition in the dehnition means that in order to observe a change of 
a hxed order of magnitnde in the vector held Vn at the nth level of the array, one has to 
wait a time of order n. In the limit n ^ oo the vector held changes inhnitesimally slowly, 
yet traces ont the cnrve 7 from beginning to end. For illustrative purposes only, here is a 
very simple example of a qnasistatic semihow: 


Example 1.7. Let X be the unit circle in the complex plane, and let ojq,oji be two real 
numbers. Then (Pn^{x) = xexp{iuoS -|- ^(ooi — 0 i!o)s(s 2r)n~^) defines a time-dependent 
semifiow on X; see (1). In fact, the vector field 


Vn{x,r) = =i(uJo-\- (wi - Uo)rn ^)x 

ds s=o 

describes rotation at angular speed cjq + (<^i — Uo)rn~^. We have 


lim Vn{x, nt) = Hujq + (cui — uJo)t)x = 7t(x) , f G [0,1] . 

n^oo 


Thus, this particular quasistatic semifiow describes a system whose angular speed on the 
circle changes infinitesimally slowly from Uq to Ui. 


Again, one can readily come up with generalizations of the above dehnition (e.g. re¬ 
garding the linear time-scaling r = nt), but we do not record them here. Let us, however, 
mention one class of models not quite falling under the dehnition provided. To compare 
with the earlier discrete-time case, set 

o • • • o T„7^j+i , 0 <r <r + s <n . 
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Then (1) holds. Note that the convergence condition in Dehnition 1.1 now becomes 

,^nt—lA _ rji 

which in this context makes more sense than the condition in Dehnition 1.5. 

Just as in the discrete-time case, it is interesting to study a quasistatic semihow in 
terms of the limit behaviour of the random paths 

pnt 

t^Sn{x,t)= / f{Xn,s)ds , 

Jo 

where 

Xn,s = V>n'{x) , 

as n —)■ cxo. A good statistical description of the QDS at issue entails proving limit laws 
for a large class of observables / and initial measures p. 

1.5. Preceding literature. This work is philosophically a natural successor of the se¬ 
quence of papers [19,28,31,33] on the analysis of time-dependent dynamical systems, 
although in scope it differs drastically. In [19], “asymptotic similarity” of pushforward 
densities under sequences of (piecewise) expanding maps was established using trans¬ 
fer (Perron-Frobenius) operator techniques. Asymptotic similarity is an indication of 
statistical memory loss: the system quickly forgets the distribution of its initial state. 
In [28,31,33], a different approach for establishing memory loss for progressively more com¬ 
plicated time-dependent systems was developed based on the idea of coupling. See [5-7,35] 
for implementations of coupling in deterministic dynamics, and [22] for an introduction 
to the probability theory of coupling. Let us also mention [14,26], which use yet another, 
Hilbert projective metric, technique for a time-dependent system, as well as [1]. The 
source of memory loss in the systems of the preceding papers is sensitive dependence on 
initial conditions (aka chaos). In the other extreme, memory loss is also produced by sinks; 
the references closest to the time-dependent setup we have been able to hnd concern ran¬ 
dom sinks [3,20]. In addition to memory-loss issues there are works including [15,17,18,36] 
on the entropy of time-dependent dynamical systems. 

Besides the ones mentioned, few limit laws have been proven on the properties of non- 
random time-dependent dynamical systems. (On the contrary, a vast literature — which 
we cannot even begin to cover here — exists on random dynamical systems, concerning 
both averaged/annealed and quenched limit theorems.) In special cases, some central 
limit theorems have been obtained [2,8,27]. Indeed, one of the difficulties one faces in 
this setting is that it is often not even clear how possible results should be formulated, 
let alone proven. 

In our proof, a key ingredient is rapid memory loss, which we establish via coupling. 
In order to identify the limit process, we solve a well-posed martigale problem [12,29,34]. 
(See Section 4 for an outline of the proof). In the context of dynamical systems, the idea 
of resolving to a martingale problem has been used (sparingly) in the theory of averaging; 
hrst, to our knowledge, in [10], and then in [9], which is closest to our work. Averaging is 
a tool in the analysis of so-called slow-fast systems concerning the limit where one of two 
variables evolves inhnitely fast compared to the other: in order to describe the evolution 
of the slow variable, one may be able to “average out” the influence of the fast one and 
thus reduce the problem to an effective one in which the slow variable alone appears. 
We stress that the abstract setup in Section 1.1 is situated rather at the other end of the 
spectrum; it involves studying the fast variable Xn,kj which (in the limit n —)■ cxo) describes 
the state of the observed system s under the influence of the large, slow, system &. Let us 
mention, however, that averaging ideas could be used to study the statistical properties 
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of QDSs in certain situations, regarding the normalized Birkhoff sum n ^Sn{x,t) (along 
with an index) as a slow variable and x as a fast variable. 

1.6. How the paper is organized. The model QDS, considered in the rest of the paper, 
is presented in Section 2. In Section 3 we state our two main results. Theorems 3.1 and 3.6, 
keeping the technical prerequisites down to a minimum. In Section 4 we attempt to outline 
the proofs in a non-technical manner. In Section 5 we explain frequently used notation and 
record key definitions. We then introduce in Sections 6 and 7 the preliminaries necessary 
for understanding the proofs of Theorems 3.1 and 3.6, which are presented in Sections 8 
and 9, respectively. In Section 3.3 we also state generalizations of the main results, whose 
proofs are outlined in Section 10; the proofs involve minor modifications of the preceding 
sections, so we only indicate the necessary changes, with the hope that such a choice 
makes the paper easier to read. 


2. The model 

In this section we introduce the quasistatic dynamical system to be studied in the rest 
of the paper. Fix A > 1 and A* > 0 once and for all. Let A4 denote the set of 
expanding maps T : S ^ § on the circle with the following bounds: 

infr'>A and ||T"||oo < Al* , TeM. 

The space A4 is endowed with the metric dpi defined by 

dci{Ti,T 2 ) = supd(Tix,T 2 X) + \\T[ - T^Woo 
xes 

for Ti, T 2 G Ai. Here d is the natural metric on S = M/Z. 

We construct a QDS with state space S and system space M as follows. First, fix a 
Holder continuous curve 7 : [0,1] —)■ Al with exponent rj G (0,1). Let T be a triangular 
array of maps 

T = {Tn^k gA 4 ; 0<fc<n, n>l} 

for which 

supn^ sup dci(T„,[nq,7t) < 00 . (3) 

n>l 0 <t<l 

Clearly (T, 7 ) meets the requirements of Definition 1.1 of a QDS. A prototypical example 
to keep in mind is where ^ = 7 fcn-i, though the maps Tn^k are not required to live 
on the curve 7 . The convergence rate (3) is chosen to reflect the smoothness of the 
curve; in particular we have similar bounds on 7 fcn-i) and on dci( 7 fcn-i, 7 (fc+i)n-i), 

resulting in the bound 

supn’^ sup dci(Tn^k, Tn,k+i) < 00 . (4) 

n>l 0<k<n 

Stronger convergence would not be a natural assumption. 

For a hxed n, the maps Tn,k approximate the curve 7 , traversing from beginning to end 
as k increases from 1 to n. Moreover, tends to the well-defined limit 7 ^ as n ^ 00 . 

For future use, we point out that every T G A4 has a unique invariant probability 
measure fiT equivalent to the Lebesgue measure m on S. The measure fix we sometimes 
call an SRB measure, for Sinai-Ruelle-Bowen. For brevity, we write 

Rt R'ft and fin,k ■ 
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3. Results 


We begin our study of the quasistatic dynamical system (T, 7 ) introduced in Section 2. 
The goal is to understand statistical properties of the QDS. So let / : S —)■ M be an 
observable, and denote 


fn,k = / o Tn,k o • • • o , 0 < /c < n . (5) 

(Our convention is that fn^ = /.) This yields a triangular array of random variables once 
an initial distribution /i on § is given. We dehne the functions S'„ : S x [0,1] —)■ M by 

pnt [ntj-l 

Sn{x,t) = / fn,lsi{x)ds = ^ fn,k{^) + (x) , U > 1 . 


3.1. The mean. A natural quantity to study first is the mean 

Cn{x,t) =n~^Snix,t) . (6) 

Given an initial probability measure on S, each (n is a random element of 

whose distribution we denote P((. As is customary, we also denote = f fdjut, etc. 

Theorem 3 . 1 . Suppose f is Lipschitz continuous and p is absolutely continuous. The 
function t i—)■ is continuous. The measures converge weakly, as n ^ oo, to the 

point mass af C ^ ^*^([ 0 , 1 ],M), where 

C{t) = [ Ts{f)ds . (7) 

Jo 

Remark 3 . 2 . Note that the limit distribution in Theorem 3.1 is independent of fi. 

In other words, given an arbitrary initial measure /i having a density, the stochastic 
process Cn converges, as n —)■ oo, to the non-random limit (. 

3.2. Fluctuations about the mean. The mean gives a coarse description of a limit 
statistical property. Deeper insight is obtained by studying the fluctuations at a hner scale, 
by looking into the statistical properties of n^C,^ instead of (n- For this to make sense, a 
centering is needed. To this end, we choose a sequence of functions : S x [0,1] —?■ M, 
and dehne the functions : § x [ 0 , 1 ] —?■ M : 

Xnix,t) = n^Cnix,t) - n^Cn{x,t) (8) 

quantifying the huctuations of n^Cn about n^Cn- The goal is to describe the statistics of 
these huctuations, as n —)■ cxo, by a probabilistic limit law. 

The choice of a good centering sequence (c„)„>i turns out to be a delicate issue. There 
are two canonical choices, both independent of x, namely c„(t) = (where /i 

is the initial measure) and Cn(t) = m(Cn( • ,t)) (where m is the Lebesgue measure). The 
hrst one of these amounts to centering by force: yi{Xn{' yt)) = 0 holds for all t G [0,1]. It 
is perhaps surprising that this “natural” choice does not appear to yield good statistics 
for Xn unless the initial measure /x has a sufficiently regular density. The second choice, 
centering using the Lebesgue measure instead, works better; in that case we obtain a 
universal limit law for Xn as long as the initial measure p is just absolutely continuous. 
We now introduce the notion of an admissible centering sequence, for which our main 
result holds: 
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Definition 3.3. We say that a centering sequence {cn)n>i of functions Cn : S x [0,1] —?■ M 
is admissible with respect to an initial probability measure p, if (i) Cn{x, ■) is continuous 
for almost every x w.r.t. p and (ii) t i-A- n 2 Cn{x,t) — n 2 m.(Cn(‘ ,t)) viewed as a random 
element of C^{[0, 1],M) converges to zero in probability w.r.t. fi, i.e., 


lim /i 

n—)-oo 


X G 


sup |n 2 c„(x, t) 
te[o,i] 


n2m(Cn(-,^)) 



0 


for any <5 > 0. For brevity, we also say in this case that Cn is admissible (w.r.t. p). 


Remark 3.4. If the centering Cn is independent of x, there is no need to refer to an initial 
measure: Cn is admissible either with respect to every measure or no measure at all; it is 
admissible if and only if 

lim sup |n 2 Cn(t) — u 2 m(Cn( •, t)) I = 0 . (9) 

Clearly Cn{t) = m(Cn( •, t)) is admissible. Since m(Cn( •, t)) converges to ((t), so does Cnit) 
for any admissible Cn. Whether ,t)) converges to n^C,{t) (i.e., whether ({t) is 

admissible) seems to depend on the regularity of the curve 7 , see Lemma 3.5(ii). On 
the other hand, given a measure v (perhaps different from the initial measure p.), Cn{t) = 
^(Cn( •, t)) fnay or may not be admissible, depending on the regularity of (the density of) v. 


The next lemma establishes admissible centering sequences of the preceding kind under 
different conditions, and sheds light on the role of the function ( defined in (7). 

Lemma 3.5. (i) If p G (0,1) is arbitrary and v is an arbitrary probability measure having 
a Lipschitz continuous density, then the centering Cn{t) = o{(n{ ■ ,t)) *■5 admissible. 

(ii) If p > then the explicit centering 

Cn{t) = C{t) 

is admissible. Here ( is the function defined in (7). 

(Hi) If p ^ (0,1) is arbitrary and v is an arbitrary absolutely continuous measure, 
then n{(n{ ■ ,t)) tends to C{t), as n ^ 00 . The convergence is uniform in t. 


Lemma 3.5 is proven in Section 9.1. 


We are now ready to proceed to the main result of the paper. Given an initial probability 
measure p on S and an admissible centering sequence (c„)„>i, each Xn is a random element 
of G°([0,1],M), whose distribution we denote P^. We also denote 

ft = f - Ptif) 


and 




lim pt 

m—^oo 



( 10 ) 


which is just the limit of the variance of f ° "It with respect to the measure pt. 


Theorem 3.6. Suppose f is Lipschitz continuous, p is absolutely continuous, and {cn)n>o 
is admissible. The function t 1 —)■ ^({f) is continuous. The measures Pjj converge weakly, 
as n —)■ 00 , to the law of the process 

X{t) = [ as{f)dWs . 

Jo 
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Here W is a standard Brownian motion, and the stochastic integral is to he understood in 
the sense of ltd. 


Let us pause to discuss Theorem 3.6 in conjunction with Lemma 3.5. Here u is an 
arbitrary measure having a Lipschitz continuous density. For every value of the regu¬ 
larity exponent rj G ( 0 , 1 ) of the curve 7 , the implicit centering Cn{t) = i^(Cn(-,t)) is 
a valid choice; in particular, c„(f) = m(Cn(',t)) can always be used, and if the initial 
measure p has a Lipschitz continuous density (some hypothesis on the density is nec¬ 
essary, see Example 3.7), also the choice c„(f) = p^iCni',t)) is admissible. The special 
case ?7 < I is interesting from the point of view that 7 is allowed to have the degree of 
regularity of Brownian paths. (With probability one, the path of a Brownian motion is 
Holder continuous with any exponent < i.) Thus, we may think in this case that the 
driving ambient system forces the observed system 7 ^ to perform a Brownian motion in 
the space Ai. On the other hand, the case 7 > | of a more regular curve 7 allows also 
for the explicit centering Cn{t) = ({t) to be used. The reason for this dichotomy is that 
the order of - ,t)) — n^C{t) appears to be so large as to contribute to the limit 

process, if the regularity exponent rj of the curve 7 is | or less. A similar remark concerns 

the order of n^p,{(ni • ,^)) ~ ''^^^(Cn( • ,t)), if the density of p is not sufficiently regular. 
Let us illustrate the last point with an example: 


Example 3.7 (An inadmissible centering). Let us return to the setting of Example l.f, 
so T = Tn^k ■ X 2x (mod 1) for all n, k, and f{x) = —1 for x G [0, and f{x) = -|-1 
otherwise. Then m(Cn( •, ^)) = 0 for all n,t. Taking as our centering the zero function, the 
random paths Xn{-) converge in distribution to standard Brownian motion, if Lebesgue 
measure is the initial distribution. Disregarding the (here inconseguential) non-Lipschitz 
nature of f, Theorem 3.6 states^ that for any absolutely continuous initial distribution, 
the random paths converge in distribution to the same process, standard Brownian motion. 
This need not hold if one changes the centering: 


For j > 1, let Aj denote the interval [0,2“^^) define the probability measure Uj = 
2‘^AAjVn. For k < 2^ — 1, T^{Aj) C [0, |) so Uj{f o T^) = —1, while for k > 2^, the 
pushforward Tfuj = m so iyj{f o T^) = m(/) = 0. Therefore ^'j(Cn( •, t)) <0 for all j, n, t. 
Let K E N be large enough that e = Ylj>K T absolutely continuous 

probability measure 

00 ^ 

p={l-e)m + J2 ■ 

j=K-J 

Since iZj{Cn{ ■ ,t)) <0 for each j, for each j we have 

h(Cn(- ,t)) < ^«^i(Cn(- ,t)) • 


Lett G (0,1] and taken big enough thatnt > 2^. Denote jn = [log 2 [nfJJ, so 2^ < \nt\ < 
Jt follows that K < jn < \ 0 g 2 nt. Now ^'j^+i((C„( •,f)) = Jq"*(— 1) ds = —t. 

Therefore, 


n^TiCni-A)) < 


{jn + 1 ) 


>^jn+l{Cn{ ■ ) ^)) 


1 1 
tn^ tn^ 

{jn + ^y~ {logout+ iy' 


^Theorem 3.6 can be extended to the setup of the example. Alternatively, a Lipschitz approximation 
of / can be used to obtain similar results; we skip the straightforward details. 
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which tends to —oo as n ^ oo. In particular, if one sets Cn{t) = /i(Cn( • ,^)); then n^Cn 
converges pointwise (and uniformly away from 0) to —oo, while n2m(Cn( •, ^)) = 0- 
Hence, {cn)n>i does not form an admissible centering sequence; see (9). 


Remark 3.8. (i) Note that the limit distribution in Theorem 3.6 is independent of fi. 
(a) The process x *■5 o martingale starting at 0, and its quadratic variation is 

[x]t = f 0 - 2 (/)ds . 

Jo 

It follows [29] that there exists such a standard Brownian motion W that 

X{t) = • 

In other words, x a Brownian motion up to the non-random time change 1 1 —)■ [x\t- 

(Hi) Regarding the possible degeneracy of the process x, o result due to Leonov [21] shows 
that, given s G [ 0 , 1 ], d'‘[{f) = 0 if and only if there exists an If function g : S ^ M. such 
that the cohomology equation fg = g°Xs~g holds almost everywhere, and Livschitz (Livsic) 
rigidity theory [23, 24] shows that g has a Holder continuous representative (with the 
same exponent as f) for which the cohomology equation holds everywhere. Equivalently, 
Elllfs o'j^[x) = 0 for any point x of any period p with respect to Xs- Since periodic points 
are dense, 6']{f) = 0 amounts to a severe restriction on the choice of f. Because periodic 
orbits are stable under perturbations of the map, it moreover follows that nondegeneracy 
is an open condition with respect to the map: if ^^{f) ^ 0 , then cr[{f) ^ 0 for all t 
sufficiently close to s. (Actually 6[[f) depends continuously on t; see Lemma 7.1.) 


3.3. Generalizations. Here we discuss two ways to generalize Theorems 3.1 and 3.6. 
First we consider vector-valued observables /:§—?■ d > 1. After that we consider 
curves 7 in Ad with a finite number of discontinuities (jumps). 

In the case of a vector-valued observable /, we define the vector-valued processes (n-, C 
and Xn according to the formulas ( 6 ), (7) and ( 8 ), respectively. The centering c„ is also 
vector-valued, but Definition 3.3 remains otherwise intact. For clarity, we denote the 
vector components by superindices enclosed in parentheses: / = (/d),..., etc. 


Theorem 3.9. Suppose /:§—?■ W^, d > 1, is Lipschitz continuous. Then Theorems 3.1 
and 3.6 as well as Lemma 3.5 continue to hold with the modifications that W is a d- 
dimensional standard Brownian motion and at{f) is the dxd matrix defined as the square 
root of the covariance matrix 


dt (/) = lim Tt 

m^oo 




Here {v v)ij = nh)y(i) y = ^ ^ 


Note that and qj-q exactly the processes appearing in ( 6 ) and ( 8 ) for the scalar¬ 
valued observable /*-*), 1 < i < d. The generalization here is that Theorem 3.9 describes 
the joint behaviour of each of these two sets of d scalar-valued processes. 


The next result allows for jump discontinuities: 
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Theorem 3.10. Let be a finite partition of [0,1] into intervals. Suppose 

the curve 7 : [0,1] —?■ A1 restricted to each fi, 1 < i < m, is Holder continuous with 
exponent rj G (0,1), having possibly jumps where two intervals meet. Then Theorems 3.1 
and 3.6 as well as Lemma 3.5 continue to hold, as does Theorem 3.9. 


4. Overview of proofs 

We start off with a probability measure jj, on the circle. This allows us to regard x as a 
random variable with distribution fi, and then Xn,k = Tn,k°'' '°Tn,i{x) as another random 
variable with distribution fin,k, the pushforward of jj,. These random variables are neither 
identically nor independently distributed. 

While not identical, we show in Lemma 6.10 that the distributions of Xn,k and Xn,k+i 
(the densities of the corresponding pushforwards) are similar to each other and to the 
corresponding SRB distribution. 

Compensating for lack of independence, one can still obtain strong decay of correlations. 
This is shown in Corollary 6.7 and Lemma 6.11, following Lemma 6.6 which is obtained 
using a coupling argument. 

In order to get uniform bounds in the aforementioned results, we must assume some¬ 
thing about the regularity of the initial density. Being Lipschitz continuous is enough; 
often we also assume that the initial density is strictly positive as this simplihes the 
arguments. 

In Section 7, we study the processes Xn under two assumptions. First, the initial 
measure jx should be strictly positive and Lipschitz continuous. Second, the centering 
should be of the form c„(t) = fi{Cn{ • ,t)) [using other measures with regular densities in 
the centering would work, but choosing pi leads to a slightly easier proof]. To emphasize 
the particular nature of this process, we denote it by instead of Xn- We control the 
second moment p^[[^n{t + h) — 'Cn(^)]^] by h times the variance at t plus an error term, in 
Lemma 7.2. Using regularity and decorrelation of the pushforward densities, we obtain 
decorrelation estimates for the processes Cr^in in Lemma 7.3. Early in the section, the 
variance (/) is shown to depend continuously on t. 

Rather than proving Theorem 3.1 directly, it is simpler to hrst prove the following 
proposition which assumes a regular initial density, recalling denotes the distribution 
ofCniu C' 0 ([ 0 , 1 ],M). 

Proposition 4.1. Suppose f is Lipschitz continuous and p. is absolutely continuous with 
a strictly positive Lipschitz continuous density. The measures P([ converge weakly, as 
n —)■ 00 , to the point mass at ( E C°([ 0 , 1 ],M), where 

C{t)= [ Ts{f)ds . 

Jo 

This proposition is proven in Section 8.1, via a Dynkin formula. Noting that continuity 
of t eA pt{f) follows immediately from (25), the assertions of Theorem 3.1 now follow if 
one can drop the regularity condition on the absolutely continuous measure. This is done 
in Section 8.2 via a portmanteau argument. 

We prove Theorem 3.6 in Section 9. Again, we hrst prove the following proposition 
assuming a more regular initial density and special centering, and then use a portmanteau 
argument together with Lemma 3.5 to pass to the full theorem. Lemma 3.5 is proven at 
the start of Section 9. 
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Proposition 4.2. Suppose f is Lipschitz continuous, p, is absolutely continuous with a 
strictly positive Lipschitz continuous density, and Cn{t) = p{(n{t))- The measures 
(distribution of ^n) converge weakly, as n ^ oo, to the law of the process 

X{t) = [ as{f)dWs . 

Jo 

Here W is a standard Brownian motion, and the stochastic integral is to be understood in 
the sense of ltd. 

The proof of the proposition is presented in Section 9.2. Tightness of the measnres is 
shown first, allowing one to extract convergent snbsequences. Again a Dynkin formula 
for the limit measure P is proven. A rather lengthy argument then implies that P solves 
the martingale problem corresponding to the desired expression for the diffusion x- By 
uniqueness of such solutions, the statement of the proposition holds. 

The generalizations are proven in Section 10. 


5. Notation and definitions 


We use Lip to denote the space of Lipschitz continuous maps (from S to M), Lip(h.) to 
denote the Lipschitz constant of a function h and ||h||Lip = Lip(h) + ||h||oo to denote its 
Lipschitz norm. 


The constant function 1 ; S —?■ M takes the value 1 identically, while if / C §, the 
characteristic function 1/ takes the value 1 on J and 0 on S \ J. 


Given T E Ai, the transfer operator Ct ■ is defined by 


Crh^x) 


E 

y&T i{x} 


h(y) 
T'M ' 


It describes the evolution of probability densities under the map T; if x is distributed 
according to probability density p, then T{x) is distributed according to the probability 
density CtP- We will often write 


Tt -^74 and ^ j, ■ 

If Ci is the transfer operator of Ti, then the transfer operator of T = o • • • o Ti satisfies 
Cf = Ck ■■■ Cl- We shall sometimes write Tz for T{z). 

In the following section, C will stand for the transfer operator of a general map in Ad 
and will be a general sequence of such operators. The constants appearing will 

not depend on the choice of these operators. We shall routinely use the facts that C maps 
probability densities to probability densities and that 

\C{uv)\ < C\uv\ < ||M||ooE|n| . 


Recall that pt is the SRB measure (equivalent to Lebesgue measure m) associated 
to Xt, and that p^.k = measure associated to T^^k- Denote by pt and pn,k the 

corresponding densities, and by pt the density of pt- 


Remark 5.1. Since Clfl converges in the supremum norm to p^ as k ^ oo, it will follow 
from (15) that Pt G Lip with 

sup llprllLip < oo . 

T&M 


(11) 
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Let Hn^k denote the pushforward 

hrijA: (Ln,fc O • • • O , 

where /i is a probability measure with density p. Then the density of pn,k is 

Pn,k ^n,k ' ' ' ^n,lP ■ 

Recall that fn^k = / ° Tn^k o ■ ■ ■ o T„^i; see (5). Given an initial measure p, we will often 
encounter the centered versions of such functions, dehned as 

fn,k = fn,k “ p{fn,k) ■ ( 12 ) 

Our main results are formulated in terms of Lipschitz continuous densities on §. How¬ 
ever, for technical reasons we introduce the following notions of regularity which will be 
convenient for the proofs. 

Definition 5.2. Given a point z G S, we define the arc Jz = S\{ 2 ;}. We denote by \x — y\z 
the length of the subarc of Jz with endpoints x, y. We say that a function —)■ M zs 

Lipschitz continuous on Jz, with constant L > 0, if 

\9{x) - 9{y)\ < L\x - y\z , x,yeJz. 

In other words, g is Lipschitz continuous except across ^ G S when the distance of x and y 
is understood as the length of the arc between the two points not containing z. 

Definition 5.3. Given L > 0, let Vl be the class of all probability densities : S —)■ M 
with the properties that (i) fj > and (ii) there exists 2 ; G S such that logfj is Lipschitz 
continuous on Jz = S \ {z} with constant L. 

For the most part we will work with the class and prove the main theorems in the 
setting of such initial densities hrst. Only then will the theorems be generalized to include 
Lipschitz continuous initial densities which are allowed to take the value zero. 

Remark 5.4. (i) A function : S —)■ M that is Lipschitz continuous with constant L 
relative to the standard metric d of S is, in the above sense, Lipschitz continuous on Jz 
with the same constant L, for any z G §. This follows from d{x, y) < \x — y\z- 

(ii) Note that if if & Vl, then e~^ < if < e^ and that if itself (without the logarithm) 
is Lipschitz continuous on Jz with constant Le^. These follow from the fact that the 
probability density if must take the value 1 somewhere on Jz- 

(Hi) In the opposite direction, if a probabililty density if satisfies if > c > 0 and is Lipschitz 
continuous on Jz with constant L, then if G T>c-il- Indeed, 

\\ogif{x) -\ogif{y)\ < c~^\if{x) -ifiy)\ < c~^L\x-y\z , x,y e Jz - 

The following lemma states that Ul^qT>l is dense in the set of all probability densities 
with respect to the L^ norm. 

Lemma 5.5. Let : S —)■ M &e an arbitrary probability density. Given any e > 0, there 
exists if G Gl>oIIl such that 


\\p-if\\L^ < e . 
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Proof. Let e > 0. By [30, Theorem 3.14], continuous functions on Jq = S \ {0} are 
dense in L^, so we can approximate (f in by a non-negative continuous function g 
such that ||y9 — gWii < I- Then gi = -|- | is strictly positive. By the Weierstrass 

approximation theorem, gi can be approximated by a strictly positive polynomial g 2 such 
that ll^fi — 5 ' 2 ||li < |- Collecting, \\(p — 5 ' 2 ||li < f, which also implies |1 — m( 5 f 2 )| < §• 
Note that = xn{g 2 )~^g 2 G see item (hi) of Remark 5.4. Moreover, 

\\p - - ^2||l 1 + 11^2 - m(^2)"^^2||Li ^ | + l^(^2) “ 1| < £ , 

which is the desired bound. □ 


6. Preliminaries I: densities, composition and decorrelation 

The conceptually important results of this section are: Lemma 6.2, which gives uniform 
control of the Lipschitz constants for the densities of pushforwards; Lemma 6.10, which 
says that the pushforward density Pn,[nt] is pretty close to the SRB density ps, provided 
the system has been running for a while and provided s is close to t; and the exponential 
decorrelation results of Corollary 6.7 and Lemma 6.11. 

Recall that 1 < A < T' and ||T"||oo < A* < cxo for all maps T G Ai. 


Lemma 6.1. Denoting by T any composition of a finite sequence of maps from Ai, 

T'ix) 


sup sup 

T x£S 


Vix) 


< oo 


(13) 


Moreover, there exists C > 0 such that, for all such T, if I is an arc on which T is 
diffeomorphic, 

- <c|r(4,,)| (14) 


T'{y) 


- 1 


for all x,y ^ I, where denotes the subarc of I connecting x and y. 


Proof. Suppose T = o • • • o Ti, where Tj E Ai. For j > 2, write Pj = Tj_i o • • • o Ti 
and let Pi be the identity map. Then 


n-'U ^ n-u DU ^ 

r {x) = r [x) 2^ 7U777T7Z1V^, (^) 1 


so, noting \P'Ax)/T'{x)\ < M 


i — l—k 


Vix) 


T>(x^2 


u 


k-1 

< , 

j=0 


from which (13) follows. 

By the change-of-variables formula and (13), 


I IM 
T'(.y} 


II 

V 


dm 


< 


’Tilx.y) 


T" 


(T'f 


dm<C|r(4,,)| . 


Now (14) follows from the estimate |m — 1| < el^°®“l|logM|, u > 0. 


□ 
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Lemma 6.2. There exists L* > 1 such that the following hold: 

(i) For all L > and T G 

CtFi C Fl . 

(ii) Given L > L^, there exists a constant N{L) such that 

Ck - ■ ■ CiVl C Pl, , k > N{L) , 
for any sequence of maps (T,) C M. 

(Hi) Let L > and z e S. Suppose r) G T>l with \ log'^(x) — log'^(|/)| < L\x — y\z 
for x,y G Jz- Let T he a composition of k maps from M. Let I G Jz be a subarc 
such that T maps I diffeomorphically onto J-j-z- Then the conditional probability density 
-0/ = IJj'ip satisfies 

CrifJi) G Vl , 

the (possible) jump being located at Tz. If k > N{L), then Crigfi) G "Dl. ■ 


Proof. Let L > 0 and assume i) G T>l, with z as in the definition above. Let T be a 
composition of k maps from Ai, for some k > 1. We can subdivide Jz into intervals 
Ji,..., such that Tm = T\Im '■ Im ^ Jtz IS smooth and bijective for 1 < m < j. Then 

j 

I^rrr.'tp , 

m=l 

where Ct-^i/j{x) = ^{rjfx). Moreover, using (13), for some K < oo, 

I ^ogCr^fj{x) - log Cr^fjiy)] 

< I ^og^p{T-^x) - log^p{T-^y)\ + I logr'(r"^x) - logr'(r"^|/)| 

< L\t-^x - T-^y\z + \\T''/(T'fWoolx - y\Tz 

<{X-^L + K)\x-y\T. = r . 

Hence e~'^Crmi^ix) < Urmi^iy) < Summing over m yields 

Crip G T)\-nL^K , ip ^Vl ■ 

Fix L* = (1 — > K. Then parts (i) and (ii) of the lemma hold. As for part (hi), 

C-riiJi) = '^r^('0/), for Im = I- Since 

I \ogCripiix) - logCripi{y) \ = \ logCr^ip{x) - logCr^ip{y)\ < r, 

and since Cripi is a probability density, we obtain part (iii). □ 

Lemma 6.3. There exists C > 0 such that 

\\Ck---Cil\\^<C , (15) 

\\Ck-■ ■ CiWco^c^ < C ^ (16) 

\\Ck ■ ■ ■ Ci^U'p^up ^ C , (17) 


and 


Cf- £ifc||Lip < CA-^llfcllLip + C||fc||c. . 


(18) 
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Proof. Lemma 6.2(i) implies Ck - ■ ■ -Cil G which implies (15), from which (16) follows 
immediately. Now let h be Lipschitz continuous. Let T = o • • • oTi. Let Xi^-k and yi^-k 
denote the preimages of x and y under the same (ith) branch of the inverse of T. (The 
inverse branches are defined relative to the shorter arc connecting x and y.) Recalling (14), 
we deduce 


\CtKx) - CTKy)\ < 

i 


K^i-k) 

V{xi_k) 


Kvi-k) 

T’{yi,-k) 


< 


E 


h{xi-k) - h{yi-k) h{yi_k) 

T'{xi-k) V{yi-k) 


' T\y,,-k) 

_V{xi-k) 


1 


< 


A ^Lip(h)d(a:,|/) ^ 

i 



+ Cd{x,y)^ 

i 


\h\{yi,-k) 

V{yi-k) 


< (A ^Up{h)Crlix) + CCr\h\{y)) d{x,y) 

< (A-'^Lip(h) +^ 11 / 11100 ) \\CTl\\ood{x,y) . 


Noting that ||£ 7 -l||oo is uniformly bounded by (15), we have 

Lip(£r/i)<C(A-"Lip(h) + ||h|U), 


which implies (18). Clearly (17) is a consequence of (18). Let us note that, if h is 
allowed to have a jump at . 2 , that is, if h is only assumed to be Lipschitz continuous with 
constant L > 0 say on J^, 

\Crh{x) - Crh{y)\ < C{\-^L + ||h|U)|a; - yW. • (19) 

□ 


Lemma 6.4. Suppose a probability density ip is Lipschitz continuous on with constant L 
for some 2 ; G S. Let T be a composition of k maps from M.. Then is Lipschitz 
continuous on with constant C{1 + L). Here C > 0 is a uniform constant. 


Proof. Observing that ||'0||oo < 1 + T, the result is a consequence of (19). □ 


Lemma 6.5. There exists a constant C > 0 such that for any Ti,T 2 G Ai, 

ll-^i — E Cdci{Ti,T2) . 

In particular, 

||A ~ '^slluip^-CO E C\t — . 

Let Ti,Ti G Ai, 1 <i < k with associated transfer operators Ci,Ci. Then 

k 

||Efc • ■ • El — ■ EiIIlip^-co < C dc^ (Tj, Tj) 

i=i 


( 20 ) 

( 21 ) 


( 22 ) 


holds uniformly. 

For obtaining perturbation estimates of the type in Lemma 6.5 it is paramount that 
the transfer operators act from a space of rather regular functions (here Lip) to a space 
of less regular functions (here (7°). See [13,16] for a general theory. 
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Proof of Lemma 6.5. Let x G S and let i? C § be the neighborhood of x of radius 
Set wt = /g T' dm G N. For each T E Ai the wt inverse branches of T on i? are well- 
defined. When dci{Ti,T 2 ) < there is a canonical correspondence between the inverse 
branches Tf/ and T^l, 1 < * < wti, such that Tffl{B) and Tffl^B) overlap (in particular, 
wti = wt 2 )- Let us denote yi = Tfl{x) and yi = Tffl^x). Then T 2 {yi) = Ti{yi) implies 

diyi,yi) < \~^d{T2{yi),T2{yi)) = \~^d{T2{yi),Ti{yi)) . 


Hence, the identity 

(A-£.)Mx)=f:(h!'.) MS.) 

i=\ 
w 

= E 

i=l 
w 

= E 


Vr!(ffi) r'(5i) 

Hvi) - Hvi 


2 = 1 


Ti{y^) 

Hvi) - Hvi) 


T[{y^) 


+ Kvi) 

+ Km) 


T[{yi) nim) 


T{{y,)-Tl{y,) ^n{y,)-T{{y, 


Ti{y,)Tl{yi) T'{y,)Ti{y,) 


yields, via (16) to obtain the uniform C, 

\\iC,-C2)h\K<C\\h\\updci{T,,T2) , 

provided (ic'i(Ti,T 2 ) < For the case (ic'i(Ti,T 2 ) > |, we can use (15), giving 

IKA - C 2 )h\K < 2 ||h|U sup ||£t 1 ||oo < C^||/^||oo| < C\\h\\updc^{K,T 2 ) . 

T&M 

The above bounds prove (20). By Holder continuity of 7 , (21) clearly holds. It now follows 
from the identity 

k 

Bk ■ ■ ■ — Ck ■ ■ ■ C-i = Bk - ■ ■ Cj+i{Cj — Cj)C,j-i ■■■ Cl 

j=i 

and the uniform bounds in (16) and (17) that also (22) is satisfied. □ 


Lemma 6 . 6 . There exists'd G (0,1) and, given L > 0, a constant C* > 0 such that 

\\Ck---Ci{K , k>Q 

for all £ Bl. 


Proof. Note that 


i’ > , 


Moreover, there exists Li > 0 such that 




RW 



e Bl, , 




We can now construct an inductive coupling scheme for exponential convergence: Sup¬ 
pose L > 0 is given and G Vl, i = 1,2. Denoting = Ck---Ci%lj\ we have 
Kn{l) — Lemma 6.2. Therefore, 


IIV'Ar(L) '^N{L)\\m — (1 ^*)\\R{'4’n{L)) ) II • 
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Moreover, G Vli, i = 1,2. Next, we repeat the procedure, treating as 

the initial densities, obtaining another factor of 1 — after iV(Li) steps, and so on. 

This yields 


lli/'fc -i/’fclUi < 2(1 - |e 

for any k > N{L) + jN{Li) and j > 0. Since the a priori bound 
for k < N{L), the result follows. 




Li < 2 holds 

□ 


Corollary 6.7. Given L > 0 and z E S, let g : S ^ W be a Lipschitz continuous function 
on Jz with constant L, and assume that m( 5 ') = 0. 

\\Ck---C^g\\L^<C^'^ , fc>0 . 

Here d is the same constant as in Lemma 6.6, and C only depends on L. 


Proof. Since m( 5 ') = 0, we have g > —L. Then the probability density 


is Lipschitz continuous on Jz with constant and By Remark 5.4, xjj E T>l- 

By Lemma 6.6, 

where C is determined by the value of L. We get 

\\Ck---Crg\\L^<C{l + L)6'^ 


as claimed. □ 

Denoting ft = f — iTi(Pt/), Corollary 6.7, above, implies that 

l|£f(ft/i)ltL. < c# (23) 

holds uniformly in t and k, with the constant depending on ||/||Lip only. 

Corollary 6.7 is also instrumental for the following lemma. 


Lemma 6.8. For any /? > 0, there exists such a constant C > 0 that for any Ti, T 2 E A4, 

llpi-p2h^<Cdci(T,,T2y-^ , (24) 

where pj is the SRB density for Tj. 

For any p' < p, there exists such a constant C > 0 that 

\\pt - Ps\W < C\t - s\^' . (25) 


Proof. We could appeal to the perturbation theory developed in [16] (see especially Corol¬ 
lary 1 and Remark 5 there). However, since we need control of pi — p 2 in only, we 
provide an independent argument. Note that pt = CtPt for all T E Ai and k > 1. 
Since the norm dominates the norm, the bounds in (22) and (11) together with 
Corollary 6.7 show that 

llpi - PsIIm < WC’lipi - P2)||li + IK/:? - 4)P2||li < 0(6’^ + kdc^{n,T2)) 

holds uniformly for all Ti,T 2 and all k. Choosing k = [log(Ti, T 2 )/logi?] on the right 
side yields (24), from which (25) follows. □ 
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Lemma 6.9. There exists b > 0 such that the following holds. Given t]' < rj and a 
probability density p that is Lipschitz continuous on Jz with constant L > 0 for some z G §, 

\\Pn,k - Pn,k\\L^ ^ Cn~'^' , k>b\ogn. 

The constant C > 0 is determined by rj' and L. 


Proof. Since Cn,kPn,k = Pn,k, we have 

Pn,k Pn,k ' ' ' kln,k—K-\-li,Pn,k—K Pn,k) P i,k^n,k ' ' ' k^n,k—K+l ^n,k^Pn-t^ 

for any K < k. In order to bound Pn,k — Pn,k in note that Lemma 6.4 and Corollary 6.7 
imply 

II Tin,A: ■ ■ ■ kln,k—K+l (^Pn,k—K Pn,k') || ^ ; 

where C is determined by L. Because ||pn,fc||Lip is uniformly bounded (see (11)) and 
because || • ||li < || • ||c0) (22) yields 

\\{Cn,k ■ ■ ■ jO,n,k-K+i - k)pn,k\\L^ < dc^ifT^g.T^^k) < . 

k—K+l<j<k 

Collecting, 

\\Pn,k-Pn,k\\L^<Ci^^ + K^^^n-^) 

for all n, K and k> K. Setting K = (—r^logn/log'd], we see that all terms on the right 
side above are bounded by Cn~^ , where C is determined by p' and L. □ 


Lemma 6.10. There exists b > 0 such that the following holds. Given p' < p and a 
probability density p that is Lipschitz continuous on with constant L > 0 for some .2 G §, 

||Pn,Lntj - Pill^i < (26) 

and more generally, 

\\pn,\nt\ - PsIIli < C(n"^' + \t- s\'^') , (27) 

provided t > bn~^ logn. The constant C > Q is determined by p' and L. 


Proof. Recalling (3), Lemmas 6.8 and 6.9 yield 

||Pn,[ntJ Pn,lnt\ P Pn,[ntJ Pt P Pt Ps||l1 ^ C(jl 4“ |T s| ) , 
as required. □ 


Recall that fn^k = / ° Tn,k o • • • o Tn,i; see (5). Corollary 6.7 is key also in the proof of 
the following decorrelation result; see [32] for a generalization. 


Lemma 6.11. There exists a constant P G (0,1) such that the following holds. Given a 
probability measure p with a density that is Lipschitz continuous on with constant L > 0 
for some z k > 2; Lipschitz continuous functions f^^\ ..., : S —)■ R; and numbers 

0 < ti < ■ ■ ■ < tk < 1; we have 


l/ytfd) . . . fPl . . . f(™) i /yt ... fPl 'll 


tm) 


for all m G {1, ... ,k — 1} and n > 0. The constant C > 0 is determined by k, H/Hnip 
and L. 
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Proof. Let p be the density of p. Let z e S be such that p is Lipschitz on with 
constant L. For the sake of brevity, let us denote 


fd) ... rM 


F = Cn,lntm} ■ --^npipF) 


and 


k 

G = JJ O Tn,\ntp O • • • O T„,Ln4^+iJ+l • 

j=m-\-l 


Claim. The Lipschitz constant of F — rri(F) on and the function G are uniformly 
bounded over all n and tj, 1 < j < k. 

Assuming the Claim, it holds true that 
PUnlntp ■ ■ ■ fnlnt^\) = ^ ° ° ° 

— m(r ■ G o o ■ ■ ■ o 

= m(F)m(G o o • • • o 

= ^(F)m(G o o... o r„,^„,„j+0 + 

= (G O o . . . o 

= M^MG o o... o r„,i) + , 

The hrst, fourth and sixth lines use dehnitions only, and the second line uses duality. The 
third and hfth lines follow from Corollary 6.7; the claim that G is bounded is needed for 
both, while the uniform Lipschitz bound on F — m(F) is crucial for the third line. In the 
hfth line we also used the fact that the density of Pn,intmj has a uniform bound on its 
Lipschitz constant; see Lemma 6.4. 


Proof of Claim. The boundedness of G is obvious. Write T = Tn^\ntm\ o • • • o 7k,i- Since p 
is a probability density, its Lipschitz property implies the upper bound p < 1 + L. Thus, 

m 

iipFiu<(i+i)nu“ii“- 

i=i 

Given x, p G Jz, we write Xi-n,yi-n for the corresponding preimages under T. (In other 
words, for each i there is an arc Ji-n containing Xi-n and yi-n such that T{Ji-n) = Jz-) 

The trivial bounds \p{xi_n) - p{yi-n)\ < L\x - y\z and If^lntpi^i-n) - fi%tjii^h-n)\ < 
Up{fJ^)\x - y\z yield 

m 

\{pF){xi-n) - {pF){yi-n)\ <\x- y\z (1 + 2L) JJ \\f^^^\\up ■ 

i=i 

As in the proof of (18), 

i 

i 

<C|x-yU(l + 2L)nil/“llLip. 

i=i 

which implies the Claim. The proof of Lemma 6.11 is complete. □ 


ipF)iXi,-n) _ ipF){yi_n) 

V{Xi_n) Viyji-n) 

riyi,-n) 

V{Xi_n) 


{pF){Xi^-n) - {pF){yi^_n) {pF){yi^_n) 

V{Xi-n) T'{yi-n) 
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7. Preliminaries II: processes (n, and Xn 


In this section we stndy the stochastic processes of interest, assnming the initial mea¬ 
sure /i has density p G Without mentioning it separately each time, the ob¬ 

servable / : S —)■ M is assumed to be Lipschitz continuous in the rest of the paper. For 
convenience, let us now recall the notations fnx = f°Tnx°- ■ 'O^np and fn,k = /n,fc~h(/n,fc) 
introduced in (5) and (12), respectively. 

Convention. Recall the dehnition of Xn in (8). Given the initial measure p, we will 
often consider the special centering Cn(t) = p{(n{-,t)) which yields /i(Xn( • G)) = 0 for 
all t. In this case, to emphasize that the initial measure has been chosen and appears in 
the dehnition of the process explicitly, we use the symbol instead of Xn, i-e-, we 
dehne the functions : S x [0,1] —?■ M : 

t) = n^Cn{x, t) - n^p{Cn{ ■ ,t)) . (28) 


In other words, given another centering sequence (c„)„>i, we have the relation 


Xn{,X,t) = +n^ / p{fn,lns\)ds -n^Cn{x,t) . 


In practice, it will be convenient to express the processes (n and using integral 
notation: we have 

[ fn,[ns\{x)ds (29) 


and 


?„(!,() = in / 


(30) 


from (6) and (28), respectively. From here on, we will routinely drop the x-dependence 
from the notation, writing just Cn(t) instead of (n{x,t), etc. Of course. 


rt2 


/ fn,[ns\ ^-5 . 


(31) 


Let us record a useful bound: given any rj' G (|,i/), Pn,[ns}{f) = p{fn,[ns}) and (26) 
imply 

h(/n,Lnq) - hs(/) = , s > bn~^ \ogn . (32) 


7.1. The variance dm 

Lemma 7.1. Recall that ft = f — Rt{f)- The limit variance in (10) can he expressed in 
terms of the series 

CO CO 

dKf) = Rt[ft] + 2 Ptiftft o 7i1 = Rtift] + 2 mimtipift)] . (33) 

k=l k=l 

The map [0,1] —?■ M+ : t e-)■ (/) is (uniformly) continuous. 


Proof. Notice that the two series are equal term by term. By (23), 

sup ImlfsClipsfs)]] < C# , 

0<s<l 
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SO the series converge absolutely at an exponential rate. On the other hand, a direct 
manipulation of (10) shows that 

m—1 

^Kf) = Mt) + 2 lim - k)fit{ftft o yf) , 


fc=l 


which suffices to prove (33). 
Defining the truncated sum 


K 




k=\ 

we have 

\ol{n-ol{j)\<\VK,-VKA^cd^ 

for all s G [0,1] and all it' > 0. Given e > 0, we fix iT so large that < e/2. Note 
that VK,t = - m{ptfY] +2J2k=i[^if^tiPtf)) - ^{PtfY]- Moreover, 

\mifCUptf))-m{fC':{pJ))\ 

<II/IIoo||4(pJ)-^"(pJ)IIm 

< ||/|looll(4 - ^^)(pt/)||M + WfWMpj - pj)|Ui 

< II/II00II4 - ^"llLip^CollA/llLip + ll/lloclKPt - Ps)f\\L^ 

< ll/liLp ~ '^sIlLip^-CollAllLip + \\pt - Ps\\l^) ■ 

By (11), (22) and (25), we see that \VK,t — Vk,s\ < £/2 and \cr^{f) — (Tg(/)| < e for any s 
such that |t — s| is sufficiently small. This proves continuity. □ 


7.2. The second moment p[[^n{t + h) — ^n(i)]^]- 

Lemma 7.2. Let L > 0 and p G The quantity p[[^n{t) — ^n(s)]^] is uniformly hounded 
over all t,s & [0,1] and n> 1. Moreover, 

(j^(/) ds + o{n~^) + h 0 ( 1 ) , (34) 

as n ^ 00 , whenever 0 < t < t + h < 1. The error terms are uniform in t and h.^ Finally, 

pt-\-h 


*.(/)ds = h»t(f) + o(h) , 


(35) 


as h —)■ 0. The error term is uniform in t. All of the bounds depend on p through L only. 


Proof. Given L > 0, the maximum of p is bounded by a constant determined by L. 
Because p{fn,k) = 0, 

+ h) - = n j I p{fn,[nsifn,[nri)drds , (36) 


It Jt 


as n —?> oo. By Lemma 6.11, 

Wn,Vns\fn,Vnr\)\<Cr\^-^\ . (37) 

The boundedness claim of the lemma now follows from elementary integration; 




h nh 


rh 


2h 


n I I dr ds = 2n / / dr ds = --- / - 1) ds < --- . 

Jt Jt Jo Js logliJo llog^l 


^We remark that the exponent in o(n 1) is not optimal, but a choice of convenience. 
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As to the second claim, let k G (0, |) be small enough that 2k < ri'{l — k) and set = 
shall now show that, in the limit n —?■ oo, the sole contribution to the double 
integral n jj.{fn^insifn,inr\) dr ds comes from the parallelepiped Pn = {(s,r) G 

[f, t : t + 2an <s<t + h — a„ and |r — s| < a„} about the diagonal. To that end, 

let 

Qn = {(s, r) G [t,t + : \r — s\ < ttn and either s < t + 2a„ or s > t + h — a„} 

and let = {(s,r) G + : |r —s| > a„}, so that [t, t + = P„U(5nUi?„. Because 

the area of is O(a^), 


n Kfn,ins\fn,inr\) dr ds = O(na^) = 0{n . 

^ Qn 

On the other hand, (37) yields 

n f f n{fn,[nsifn,[nr}) dr ds = = o{n~^) . 


Rn 


Thus, only the contribution of is significant: 




n 


[nsj ^77, [nrj ) dr ds 71 


ft Jt 


f*t-\-h — CLji pS-\-(ln 


f t-\-‘l(ln J S — CLn 


;‘(/n,[n.j/n,[nrj) drds + 0 {na^^) 


(38) 

Note that on P„, s — an > t + an > Rn ^ bn ^ logn for all but finitely many n. (This 
motivates the odd 2a„ in the definition of P„.) By (27), 


so 


This implies 

/*S-\-(ln 


sup ||Pn,Lnrj “ Ps\\l^ = 0{n ^ + <) = O^a^ ) , 

re(s—a„,s+a„) 


sup |/i(/n,Lnrj) “ h.(/)| = C>«') . 

rG(5—an,5+an) 


(*S-\-(ln 


(39) 


n 


pi,fn,lns\fn,lnr\') bL n I pi,fn,lns\fn,lnr\') psi^f') dr T 0{nan ) • (^0) 


We split the domain of integration [s —a^, s + a„] on the right side into two halves. Setting 
= b(i — {ns}), we have (using (39) to pass to the third line) 


'*S-\-CLn 


71 


[nsj fn, [nr\ ) dr n / p{fn,lns}fn,ln{s+r)}) dr 

Js Jo 

pO’n 

[nsj (/* ^ I [nsj (//* ^ ^n,[n(sH-r)J O • • • O -j-i) dr 

J bn 

PO'n 

= bnnfisif) +n Psiff o Tn^in{s+r)\ o • • • o T„,Lnsj+i) dr + 0(a(^' + na),+^') 

J bn 

rfjn PO,n 

= n m{fpsf) dr + n m{fCn,in{s+r)\ ■ ■ ■ Cn,ins\+iipsf)) dr + 0(na),+^') . 

t-/ 0 'J bn 

We can replace Cn,\n{s+r)\ ■ ■ ■'Cn,[nsj+i by since, recalling (22) and (11), 


iLi 


||/:n,W.+OJ • ■■Cn,\_ns\ + l{Psf) - £ ^ J) | 

< ||£„,W.+OJ ■ • ■ ||Lip^co||Ps/||Lip 

< Cnai^^ = 0{nal^^') 
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uniformly for r G [0,0^]. Hence, 




n J /i(/n,M/n,Lnrj) = ipj)) dr + Oin^al-^^') . 

A similar computation, which we leave to the reader, yields 

n f h(/n,Lnsj/n,Kj)dr = n T m(/4H-Ks+.)J. 

J S — dn —^71 

By (40), we have thus shown that 


f*S-\-(ln 


n 


= n 


= n 


dn 

poo 


[nsj fn, [nrj ) dr 

dr + 0(r“" + n^al+^') 


= ^sif) + 0{n^al-^^') . 

The second last line follows from \m{fsCg{psfs))\ < C\\Cg{psfs)\\L^ < (see (23)) and 
the last one from (33). Recalling (38) and (36), we obtain 

pt-\-h—dn 

f^[[^n{t + h) -^n(t)]^] = 


' t-\-2dn 

pt-\-h 


if) ds + 0(nal + 


cr„ 


(/) ds + o(an + na^ + hri^a^^' 




(T\ 


'(/) ds + O , 


which implies (34), by choice of k. 

Finally, the function s i-A- (J^(/) is uniformly continuous by Lemma 7.1. Hence, it has 
an increasing modulus of continuity w : [0,1] —)■ M+ such that — d^(/)| < tr(|t —s|) 

holds for all s,f G [0,1] and lim^^o'ii'(<^) = 0. Therefore, | (T^(/) ds — ha'^{f)\ < 

hw{h) = o{h) as h —)■ 0, uniformly in t. □ 


7.3. Decorrelation at the process level. Next, we introduce useful partitions of S 
having the property that x i—)■ ^n{x,t) (and x i-A- (n{x,t)) is nearly constant on each 
partition element. To that end, fix 2 ; G S. For any integer n > 1 and real number 
t G (0,1), there exists an induced partition Vz,n,t = {Iz,n,t,j}f=i of the arc = S \ {z} 
into subarcs Iz,n,t,j with the property that the restriction of Tn^int] o • • • o Tn,i to Iz,n,t,j is 
one-to-one and onto Jz„ ^ = S \ {zn,t} for all j. Here Zn,t = ^n,[ntj o ■ • • o Tn^i{z). It follows 
from the uniform expansion property that 

\Tn,lnslO---oTn,l{Iz,n,t,,)\<CX^<^^-d 

for all s <t. Since / G Lip, this yields the uniform bound 

\^n{x,s) - ^niy,s)\ <Cn~^ ( 41 ) 

for all X, 1 / G Iz,n,t,j and j, for all s < t. Given bounded and Lipschitz continuous 
functions Bi,..., B„i on R, real numbers 0 < fi < • • • < fm < f, and a probability 
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measure /i with density p > 0, integrating the previous bound with respect to d/i(|/) 
yields the existence of C — determined by ni<fc<m ll-^fcllnip — such that 


\Bi{^n{x,ti)) . . . . . Bm{^ri{tm))]\ < Cfl 2 (42) 

for all X G Iz,n,t,j and j. Here lJ^z,n,t,j denotes the conditional measure ^ ^ ■] . 


Lemma 7.3. Suppose p G y^L>oB>L. 

(i) If A e then 

p[H(C„(s))[Cn(t) - Cn(s)]] - p[A{Cn{s))] p[Cn{t) - Cn(s)] = o(l) 
as n ^ oo, whenever 0 < s < t < 1. 

(ii) If A ^ and q G {1,2}, then 

t^[A{^n{s))[^n{t) - ^n(s)]'^] - p[H(^,,(s))] p[[^n{t) “ ^n(s)]‘'] = o(l) 
as n ^ oo, whenever 0 < s < t < 1. 


Proof. Note that, since (n is uniformly bounded, it suffices to assume A G in both 

parts of the lemma. We only prove part (ii) concerning and leave the easier, but 
similar, part (i) concerning (n to the reader. 

Let z G § be a point such that log p is Lipschitz continuous on with constant L > 0, 
and consider the induced partition Vz,n,s = {Iz,n,s,j}^=i relative to this point. Since 
p[[^n(t) — ^n(s)]'^] is uniformly bounded by Lemma 7.2, a special case of (42) yields 

h[^(^n(s))[^n(t) -^n(s)]''] 
j 

= - ^n(s)]''] +0(n"^) 

j 

= '^P[^h,n.sAi^n{s))]Pz,n,s,j[[^n{t)-^n{s)Y]+0{n-^) . 

j 

Here Pz,n,s,j is the measure p conditioned on Iz,n,s,j', let us denote the conditional density 
by Pz,n,s,j- To hnish the proof, it is enough to show that 

max \pz,n,s,j[[^n{t) - Cn(s)]‘'] - p[[^n{t) - ^n(s)]'^]| = o(l) 


as n —)■ oo. Fixing p > \ arbitrarily, it follows from (31) that 


I,n{t) - in{s) = n2 


' s-\-n~P 


fn,[nr\ d'T -|- o(l) 


holds for any t and s. In other words, the lower limit of integration can be slightly 
increased, and it is sufficient to prove 


max 

j 




1 

n2 


' s+n P 


fn,[nr\ dr 




1 


' s+n P 


fn,[nr] dr 


= 0 ( 1 ). 


The increment n ^ facilitates the use of convergence of the pushforwards of Pz,n,s,j 
and p. Indeed, since /n,[nrj is uniformly bounded, it suffices to show that 


max II pp ''' I~'jip(^Pz,n,s,j p)||li oiji 2) . 
j 
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Without loss of generality, we may assume that L > L*. By virtue of Lemma 6.2, both of 
the densities Pz,n,s,j ^n,ins} ■ ■ ■ ^n,iPz,n,s,j uud Pn,ins\ ‘'' ^n,iP ure theu in 

Lemma 6.6 now yields 

max IIP)J • ■ ' ^n,lns\+l(^Pz,n,s,j Pn,[nsj)||Ll ^ C'& , 

where C depends on L. The proof is complete. □ 


8. Proof of Theorem 3.1 

Recall the integral expression of C,n in (29). Given an initial probability measure p on S, 
each (n is a random element of ^*^([0,1],]^) with distribution P(). The corresponding 
expectation will be denoted by E^. 

In probability theory, the notion of tightness plays a central role in obtaining limit 
laws: Since G°([0,1],M) is a complete and separable metric space, Prohorov’s theorem 
states that a collection of probability measures on it is tight if and only if the collection 
is relatively sequentially compact in the topology of weak convergence [4]. Hence, a 
tight sequence of measures is guaranteed to have limit points in the topology of weak 
convergence. 

Lemma 8.1. Let the measure p be arbitrary. The sequence of measures (P(()„>i is tight. 


Proof. Note already that, for ti,t 2 G [0,1], 

pt2 

Cnif'2) / fn,\ns\ ds . 

Jti 

Accordingly, |Cn(t 2 ) - Cn(H)| < (^2 - H)||/||oo- In other words, the sequence (Cn)n>i of 
functions is uniformly Lipschitz and bounded. This suffices for tightness on the classical 
Wiener space ([0,1], M). □ 

Next, we are going to show that the sequence actually has a unique limit, which we are 
going to identify. The following Dynkin formula will turn out useful in this regard. In order 
to formulate it properly, let us introduce the evaluation functionals 1], M) —)■ M, 

t E [0,1], dehned by 

7ri(a;) = a;(f) . 

Lemma 8.2. Let p G Suppose P is the weak limit of a subsequence 

and denote by E the expectation with respect to P. For any A G 

^E[A o TTi] = E[A' o TTt] pt{f) . (43) 

Proof. Let A G C'°°(M). Using the uniform Lipschitz continuity and boundedness of (Cn)n>i, 
we get 

^{Cn{t + h)) — A{(n{t)) = A' {(n{t))[(nit + h) — Cn(^)] + 0{h^) , 

where the error term is uniform. Next, we integrate the above expansion with respect 
to p and take n ^ oo along the subsequence {nk)k>i- Lemma 7.3 guarantees that 

p[A'{Cn{t)) [Cn{t +h)- C(t)]] - p[A’{C^{t))] p[Cn{t + h) - Cn(i)] = o(l) 
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as n — ?■ cxD. For the weak limit P, 

hm n[A{Cn{t + h)) - A(Cn(t))] = E[A o - Aoiit] 

and 


k^oo 


lim n[A\Cn,{t))] = lim E^ [A o m] = E[A o m]. 


k^oo 


/c—)-oo 


Recalling (32), 

By Lemma 6.8, 




\im IJ,[Cn{t + h) - Cn{t)] = fis{f)ds . 




fis(,f)ds = fit{f)h + o{h) , 


which finishes the proof. 


□ 


8.1. Proof of Proposition 4.1. We now show that the weak limit P of the subse¬ 
quence (P((j.)fc>i is the point mass at ^ G C°([0,1],M) defined in (7). 

With the aid of (43), we begin by computing 

Le[K, - c(i)t] = - 2C(i)4EW - 2 EW 4(t) + k|c(t)|2 

= 2E[77j i%{f) - 2C(t)i%{f) - 2Elir,] A.(/) + 2C(()A.(/) = 0 . 

Since ttq = 0 almost surely with respect to P and C(0) = 0, we have E[|7rt — C(i^)P] = 0 
for all t G [0,1]. By Tonelli’s theorem. 


E 


\ 7 it - C(^)Pdt 


Uo 


= / E[|7ri-C(t)r]dt = 0 


By the continuity of the paths, this proves the claim. 

In particular, the limit P is independent of the initial density p G Ul>o 7 ^l, and of the 
weakly converging subsequence (P((^)fc>i. Thus, we have shown that, for any initial mea¬ 
sure p with such a density, the sequence (Pn)n>l itself converges weakly to P, completing 
the proof of Proposition 4.1. □ 

8.2. Proof of Theorem 3.1. Next, suppose u is an arbitrary absolutely continuous 
initial probability measure with density tfj. Let F : C'°([0,1],M) ^ R be an arbitrary 
bounded continuous function, and denote M = supj^g( 3 .o([o,i],R) |-E(a;)|. By Lemma 5.5, 
given any e > 0, there exists a measure p with density p G Ul>oT’l such that 

W-PlU.<^. 

By the established weak convergence of (Pn)n>l, there exists an integer iV > 0 such that 


lESIFl-ElF]!^ 


n > N 


|e:;[f]-ec[f]| = 


F{Cn{x, ■))du{x)- / F{Cr,{x, ')) dp{x) 


< j |E(Cn(a;, •)) (V’(a;)-p(a;))|dm(a:) 

< pWl^ < I, 


Then 
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SO that 

\E^^[F]-E[F]\<e , n>N. 

By the portmanteau theorem, this suffices to show that (P|:^)„>i converges weakly to P. 
The proof of Theorem 3.1 is now complete. □ 


Remark 8.3. The last part of the proof implies that the results of this section hold for 
arbitrary absolutely continuous initial measures. 


9. Proofs of Lemma 3.5 and Theorem 3.6 

9.1. Proof of Lemma 3.5. If r; G (0,1) is arbitrary and i/ is a measure having a Lipschitz 
continuous density fj, then Corollary 6.7 yields 

1 1 if* 

,t)) ,t))\<n^ / W{fn,[ns}) - mifn,[ns})\ds 


<n^ Wn,[ns\{f) -mn,[ns\if)\ds <n2 


^n, [nsj ■ ■ ■ hl-npidP dm 


ds 


<Cn^ / d^^ds<Cn—2 


uniformly in t. Hence the admissibility condition (9) is satisfied. This proves item (i). 
Now assume 17 > ^ and fix 77 ' G (|,i 7 ). Recalling (32), 

\n^C{t),t))\ <n^ [ \fis{f) - m{fn,insi)\ds < + n~hogn) . (44) 

Jo 

Again the admissibility condition (9) is satisfied. This proves item (ii). 

Finally, let e > 0 be arbitrary, and denote the density of u by fj. By Lemma 5.5, there 
exists a measure jj, with density ip ^T>l such that ||(/? — Tf H^i < 


— 1 £ 
00 2 * 


Then 


sup |l/(Cn(-,f)) -C(f)l < / k(/n,Lnsj) -/i.(/)|ds < - + / \p{fn,[ns\)-Ts{f)\ds. 

7e[o,i] Jo ^ Jo 

By (32), the last term is bounded by | for all large enough n. This proves item (iii). 

This finishes the proof of Lemma 3.5. □ 


9.2. Proof of Proposition 4.2. Throughout this section we will assume that the initial 
measure /i is given and use the centering c„(f) = p{Cnit)) for the process Xn- Recall the 
convention from the beginning of Section 7 that in this case we write instead of Xn, 
in order to stress the explicit role of the initial measure in the centering. Thus has 
the definition in (28) and the integral expression in (30). Each is a random element 
of C°([0,1], M) with distribution The expectation corresponding to will be denoted 
by E((. 

The rest of the subsection constitutes the proof of Proposition 4.2. As in the previous 
section, the first step is to prove tightness. 

Lemma 9.1. Let p be a measure with a density that is Lipschitz continuous on for 
some z G §. The sequence of measures (P(()„>i is tight. 
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Proof. Note that ttq = 0 almost surely with respect to for all n > 1. By Kolmogorov’s 
criterion, it is sufficient to hud a constant K > 0 such that 

p[\^nit2) - = E^[|7rf2 - < KTlta-tip (45) 

holds for all ti,t 2 ^ [0,1] and all n > 1. By symmetry, the left side can be expressed as 

pt2 rs pr pu 

I 4! Tl I I I I • 

J ti J t\J t\J t\ 

Observe that iJi{fn,[nt\) = 0 for all t. Hence, for n < u < r < s, 

by Lemma 6.11. Assuming p < ^ 2 , let us dehne h = n dxdy. Then 

/ < 4! 


by the preceding bound. Since 
rv 

/ ^-t^da: = - 
Jti n log 1 ? 2 

we get 

L < 


1 'i9~iy 


n log 'd 2 


(^2 — ti) 

logit^z 


which gives the desired estimate in (45). Hence, (P^)n>i is tight. 


□ 


As in the previous section, the next step is to study the weak limit points of the 
sequence (P^)„>i, whose existence is guaranteed by tightness. Again, a Dynkin formula 
will be instrumental, albeit additional work will be required in the present setting. We 
write A G C'^(M) if A G C'°°(M) and A vanishes outside a compact set. 


Lemma 9.2. Let p G Suppose P is the weak limit of a subsequence (P((j,)fc>i, 

and denote by E the expectation with respect to P. For any A G 


E|/l o IT,] = E|/l ° "‘ol + 1 / E|/l" o irj dl(f) ds . 


(46) 


Proof. By Taylor’s theorem, there exists such a n G M that 
+ h)) = A{^nif)) + {^n(t))[^n(t + h) — ^n(^)] 

+ i4l"(f„(())|f„(( + h) - + ft) - Ut)f . 

Note that 

p[\^n{t + h) - ^ri{t)\^] <Ch^'i=o{h) 

by Jensen’s inequality together with (45). Since A'" is bounded, we thus have 

+ h))] = p[A{^nii))] + p[-^' + h) — fri 


+ i^n{t))[fn{t + h) — 'Cn(^)]^] + o{h) 

where the error term is uniform in n and t. 

Lemma 7.3 guarantees that, for q G {1,2}, 

p[A^'^\fn{t))[fn{t + h)- Cn(i)]''] - p[A^'^'> {fn{t))] p[[fn{t + h) - ^n(t)]®] ^ 0 

as n ^ oo. By (31), 


p[fnit + h)- ^„(f)] = 0 . 
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Next, note that 


lim fi[A"{^ndt))] = E[A" o TTt] . 

k^oo 


By Lemma 7.2, 

lim + h)- ^n{t)f] = ha^{f) + o{h) . 


Recall that the function t i-A- d'^{f) is continuous. Since also t i-A- E[A" o vrt] is continuous, 
we thus arrive at (46). □ 


Define the differential operator 



dx'^ 


Note that appears on the right side of (46). Lemma 9.2 thus leads us to conjecture 
that the limit process x is a diffusion with as its generator. That is, x should solve 
the stochastic differential equation 


dx{t) = Mf)dWt , (47) 

where Wt is a standard Brownian motion. Indeed, ltd calculus for x defined by (47) 
yields a Dynkin formula which is of precisely the same form as (46), with the law of x in 
place of P. We proceed to prove rigorously that the limit process is indeed characterized 
by (47). 

Let us briefly discuss the solutions to (47), which are here always required to start 
at 0. Since the coefficient dtif) is bounded in t and independent of x, given a Brownian 
motion, there exists a strong solution (adapted to the filtration generated by the Brownian 
motion) which has continuous paths and is strongly unique (i.e., its modifications are 
indistinguishable). Moreover, weak solutions to (47) are unique in law; from here on 
we denote the associated law by Q. These facts imply that the martingale problem 
corresponding to the generator and the starting point 0 is well posed: 


Lemma 9.3. The measure Q is the unique measure with the properties that Qijio = 0) = 1 
and that, for all A G (^“(M), the process 


Mt = A o 71^ — A o ttq — / T£sA o tTs ds , t G [0,1] 


(48) 


is a martingale with respect to Q and the filtration {^t)o<t<i, where is the sigma-algebra 
on C°([0,1], M) generated by {tt^ : 0 < s < t}. 


We refer to [29] for the proofs of the above statements. Also [12] is a helpful text on 
stochastic analysis. 

The next result states that also the measure P solves the above martingale problem. 
Hence, it follows from the lemma above that ¥ = Q. 


Proposition 9.4. Suppose the density of the initial measure p, is in Ul>oT>l and that P 
is the weak limit of a subsequence (Pnfc)A:>i- Then, given any A G (^^[^(M), the pro¬ 
cess {Mt)t£[o,i] defined in (48) is a martingale with respect to P and the filtration {dt)o<t<i- 
In particular, 


P = Q . 
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Note the limit P is then independent of the initial density p G and of the weakly 

converging subsequence (P^^)fc>i. Thus, for any initial measure p with such a density, the 
sequence (P^)„>i itself converges weakly to P. Accordingly, we have identified the limit 
of to be the stochastic process y appearing in (47). In particular, once this proposition 
is proven, we will have proven Proposition 4.2. 


Proof of Proposition O.f. Let L > 0 and 2 ; G S be such that the logarithm of the initial 
density p is Lipschitz continuous on with constant L. Since A and A" are bounded, we 
have E[|M 4 |] < 00 for all t G [0,1]. It remains to prove that 

E[Mt - M, 11^,] = 0 

whenever 0 < r < f < 1. The martingale condition above is equivalent to the one that 

E[Bi O TTti • • ■ 5m O TTt^ {Mt - Mr)] = 0 (49) 

whenever m > 1; and Bi,, Bm : M —)■ M are bounded, Lipschitz continuous functions; 
and We now fix such numbers and functions for good. 

Let us fix now q G (0, 4), and write Kn = [n'^(t — r)J and 6n = (f — r)/K^. Since 


Kn-l 

Mt - Mr = ^ {Mr+{k+l)5r, - Mr+kSr) , 
k=0 

equation (49) will follow once we establish that 

Kn sup p[Bi{^n{tl)) ■ ■■Bmifnitm)) (Mu+S^ - M„)(^„)] = o(l) , 
r<U<t—Sn 


as n ^ cx). 

We proceed as in the proof of Lemma 7.3, resorting to the induced partition Vz,n,u = 
{h,n,u,j}^=i of fhe arc = S \ {z}. Let Xz,n,u,j denote the midpoint of Iz,n,u,j and 
Cz,n,u,j = ^n{xz,n,u,j, u) the value of ^n{' ,u) dX the midpoint. It will then be convenient to 
define the function ^n,u : S x [0,1] —?■ M by setting 

t) t) fn(^X, u) p Cz^n,u,j 

for all X G Iz,n,u,j and all j. (For brevity, we suppress the ^-dependence of ^n,u-) We 
think of fnu cis a modification of ^n according to the value of the latter process at time u. 
By (41), 

sup|^„(x,f) =niax sup |Cn(a;,n) - fn{Xz,n,u,j,u)\ < Cn~^ , 

xeS 1 XGlz^n,u,j 

uniformly in u. Since A and A" are Lipschitz continuous, the functional M^+Sn ~ '■ 

C°([0,1], R) —)■ R satisfies 


sup \{Mn+S„ - Mn){^n{x, ' )) “ {Mn+S„ - M„)(4,«(x, ■))\<Cn 2 . 
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Because Bi,, Bm are furthermore bounded, Lipschitz continous functions, (42) yields 




0{n 




(AW. - + 0(n-5) 

= - M„) ( 4 , J +0(72"^) 

j 

= - M„) (4,«) + 0 (n"^). 

j 

Here the error term is again uniform in u. Since = o(n^), it thus suffices to show that 


Kn sup max 

r<U<t—Sn ^ 






= 0 ( 1 ). 


Here 


ru+5n 


{Mu+Sr, - Mu){^n,u) = + ^n)) “ A{^n,u{'<^)) “ / AfsA(^n,u{^)) . 


For X e h,n,u,j, we Taylor expand + 5„)) at = (^z,n,u,j- By Taylor’s 

theorem, there exists Kz^n,uj{x) G M such that 


{Mu+ 5„ - Mu){in,u) = A’A^n,u,j) [AA + S^) - 

T 


fU+Sn 


A''A,n,u,j) [A{U + (5n) - A{u)] - / A^sAi^nA^)) 


1 


+ qA'" [Kz^n,u,j) [AA 4“ A) ~ 'Cn(rt)]^ 


We bound the term on each line on the right side separately. The second line requires 
demonstrating cancellations in the difference and is saved for last. 

It is instrumental that (assuming [nfij > N{L)) the density Pz,n,u,j of Pz,n,u,j satisfies 


P. 


z,n^u,j 


= c 


n, \_nu\ 




n,lPz,n,u,j 


ev. 


by part (iii) of Lemma 6.2. We denote the measure corresponding to pz,n,u,j by Pz,n,u,j 
Moreover, it will be convenient to define 


I fn,[ns],lnu]+l /^n,[nuj (/n,[nsj,[nuj+l) Cl'S 


pu+h 


where fn,k,i = / o Tn,k o ■ ■ ■ oTn,i for k > I and fn,k,k+i = /• Then 

A A + h) - A A) = in\h) o Tn,\nu\ O ' ' ' O . 

It is helpful to think of as A shifted along the curve 7 . Below, we will need to change 
the centering of Ai \ so we already define 

pu+h 

= A fn,lns\,lnu\+l “ l^(/n,[nsj,[n^J+1) = AAKA - 

J u 


for an arbitrary measure u with density '0 G T>l^. By Corollary 6.7 applied to the 
difference g = ip — Pn,\nu\-, we have the uniform bound 

Hfn,VnslVnu\+l) " /^n,[n.J (/n,[n.J,[n^J+l) I = 0(7?^-^-^’^“^) = 0(79"(*-“)) . 
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Hence, an integration yields 

/ u+h 

^{,fn,[ns],[nu\-\-l') [nuj [nsj,[nuj+ 1 ) ^.5 0 (tI 


uniformly in u, h and z/, so that 



(60) 

and, by v[^n^''\h)\ ^ 0, 


’'[&W]=0(n-i) 

(61) 


uniformly in ti, h and u. Note that the error terms above are independent of x. 


The first term. Since 

where A' is bounded, we only need a bound on the second factor on the right. Here 

Recalling the earlier remark on flz,n,ujj (51) yields 

= o{K-') . 

In particular, 

Kn sup max (^Cz^n,uJ^ [^n(^ "b * 

r<u<t—Sn ^ 


The third term. Using the boundedness of A'" together with Jensen’s inequality, 

3 

\h‘Z,n,u,j \_A ^nz,n,u,j) ['b)T-(^ J~ 'bn(ll)] ] | ^ Cflz^n,u,j J~ ^n) ] 

Here 

Tz,n,u,j [[Cn(M + 5n) - Cn(M)]^] = Tz,n,u,j • 

Let us consider an arbitrary measure u with density 'ijj G instead of fiz,n,u,j- First of 
all, we have the uniform bound 

analogously to (45). Together with (50), Jensen’s inequality then shows that 

^[^^n\hy] =0{h^ + 0n-^ + hn-^ + n'^) (52) 

uniformly. In particular, 

= 0(6l + 6ln-i + S„n-' + u-^)i = o(K-^) , 

SO that 

Kfi sup max [d (^2:,n,w,j) “b ^n) 'Cn(^)] ] I ^(1) * 

r<u<t—Sn ^ 


The second term. Note that 

;i 

Tz,n,u,j 

where 


-A''iCz,n,u,j) [^n(M + <5n) “ ^n{u)y 


2^ ^^z,n,u,j') Tz,n,u,j\\,y,n(d^ A Sn) 'bn(ll)] ] ) 


Tz,n,uJ [[^n(M + dn) - Cn(M)]^] = Tz,n,u,j 
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On the other hand, 

ru-\-5n 


_J u 

f'U+Sn 




1 




MIWi^nAs)) 


ds 


1 


ru+ 5 n 


CU+Sn 




2 A 

pu+Sn 

2 ./, 


a: 


cr. 


'if) l^z,n,u,j -u)o Tn^lnu] O • • • O + C^,n,u,j)] ds 

'if) flz,n,u,j - m) + C^,n,u,j)] ds . 


By Taylor’s theorem, there exists Kz^n,u,j,sA) ^ ^5 such that 

f‘Z,n,u,j (A ^z,n,u,j)~\ ~ ^ (Cz,n,u,j) T f‘Z,n,u,j {f^z,n,u,j,s) Cn ^)] 

Using the boundedness of A'" together with Jensen’s inequality, (52) yields 


lA,n,uJ [A'"A,,n,uJ,s) -«)]!< C/lz,n,uJ - u)^] " = 0(j|) . 


Accordingly, 


l^z,n,u,j 


PU+5n 


■^sMLA^)) ds 


2^ {^z,n,u,j) 

= Ia"(c 


rU+Sn 


Asif) ds + 0{6A 


z^n^u^j, 


d«(/)^n + 0(J„) . 


In the second line we used (35). We remark that the error term o(Sn) = o(K^ ^) is uniform 
in u and j, and that A" is bounded. In order to prove that 


Kn sup max 

r<U<t — Sn ^ 


f^z^n,uj 


f*U+Sn 


-A''iCz,n,u,j) [fn{u + (5„) - A{u)] “ / A^sMA,u{^)) ds 


tends to zero as n —?■ oo, it thus only remains to show 

Kn sup max \fiz,n,u,j[^A{^n)‘^] “ ^Hf) 5n| = o(l) . 

r<U<t—Sn ^ 

Let us again consider an arbitrary measure u with density ip G Pl, instead of fiz,n,u,j- 
Recalling (50) and v\iA^\h)'\ = 0, we get the uniform estimate 

= qe‘"’(«"] +o(n-‘). 

Analogously to Lemma 7.2, 

Kn\AA/^\Sn)^]-dl{f)6n\=Kn\0{6n)+0{nA)\=0{l) , 
uniformly in u and u. The last bounds combined yield the desired bound. 

This finishes the proof of Proposition 9.4. □ 


The proof of Proposition 4.2 is now complete. 


□ 
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9.3. Finishing the proof of Theorem 3.6. It remains to upgrade Proposition 4.2 to 
the full version of Theorem 3.6. The upgrade entails relaxing the regularity assumption 
on the initial measure as well as the assumption that the centering sequence be dehned 
in terms of the initial measure. To facilitate these changes, let us introduce the explicit 
notation 

Xnit) = nHn{t) -U^Z/(Cn(t)) , 

for any measure u, and 

Xn{t) = n^Cn{t) -nhn{t) , 

for any centering sequence {cn)n>i- Note that all functions above, including c„, depend 
on X, but following our earlier convention we suppress it from the notation. Given an 
initial measure /x, we denote the laws of Xn Xn" by and respectively. The 

respective expectations are denoted and 

Assume /x is an arbitrary absolutely continuous measure with density p and {cn)n>i 
is an arbitrary centering sequence admissible with respect to /x; see Dehnition 3.3. Our 
proof of Theorem 3.6 amounts to showing that there exists a measure u with density G 
Ul>oT>l with the following properties: (i) approximates P, (ii) approximates 
(hi) P((’'" approximates PJ)’"", and (iv) approximates arbitrarily well for all large 
enough n > 1. This will be accomplished using the portmanteau theorem, as follows. 

Let F : (^^([O,!],]^) —)■ M be an arbitrary bounded Lipschitz continuous function 
and e > 0 an arbitrary number. Denote M = sup^gc' 0 ([o,i],K) and £ = Lip(F). 

Step (i). Observe that Proposition 4.2 applies directly to P((’^: for any ijj G there 

exists an integer A^i > 0 such that 

|ElF|-ErlX’]l<|. n>N,. 


Step (ii). Since ^|J G Ul>oD2,, the centering sequence p{((n{t)) is admissible with respect 
to z/; see Lemma 3.5. By Remark 3.4, there exists an integer A "2 > 0 such that \n'^v{C,n{t)) — 
n^xn{(n{t))\ < ^ for all n> N 2 . Thus, 


iEr[F]-Er[i^]i = 


F{x:{x,-))-F{x::{x,-))du{x) 


< £ 


U 


< - , n > N2 . 
- 4 ’ - 2 


Step (Hi). By Lemma 5.5, we may assume that ^|J G Ul>o'F>l satisfies 


IIV'-pIIli < 


4M 


Then 


iEr[F]-Er[i^]i = 


FiXnix, ■))du{x)- / Fix'Oix, ■))dp{x) 


< / •)) (V'-P)( 2 :)|dm(x) 


< pWli < - , n > 1 . 

Step (iv). Recall that the centering sequence is assumed admissible with respect to /x. 
Let us denote En = {x E S : sup^g^oq] \'n^Cn{x,t) — n2m(Cn(t))| > There exists an 
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integer > 0 such that ^{En) < 


for all n> Ns. Splitting J = Je^ + 

= I FiXnix, ■))-F{Xnix^ ■))Mx) 

< 2Mfi{En) + £-^{1 - /i(-E„)) 


we have 


Collecting the bounds, we have shown that 

\E[E] - [E]\<e , n> max{Nu iVs, N^) . 

By the portmanteau theorem, this suffices to show that converges weakly to P. 

The proof of Theorem 3.6 is now complete. □ 

Remark 9.5. The last part of the proof implies that the results of this section hold for 
arbitrary absolutely continuous initial measures and admissible centering sequences. 


10. Proofs of the generalizations 

As mentioned at the beginning, little in the proofs of Theorems 3.1 and 3.6 changes 
when one passes to their generalizations. Theorems 3.9 and 3.10. In order to keep the 
presentation as lucid as possible, we have elected to save the generalizations for last. Here 
we expect the reader to be well familiar with all the preceding sections. Indeed, we will 
only point the reader to the straightforward adjustments required there to complete the 
proofs. 

10.1. Proof of Theorem 3.9. The first changes occur in Section 7. In Lemma 7.1, 
is now the d x d matrix defined in Theorem 3.9. Working with vector components, it 
amounts to a minor modification of the proof of Lemma 7.1 to show that 

OO 

(/) = Ttift ^ft] + Yl Tt[ft ® {ft o It) + {ft o It) ® ft] 

= Tt[ft «) ft] + '^m[ft (8) 4(pt/t) + {E'l{ptft)) <8) ft] , 

k=l 

and that the dependence on t is continuous. Similarly, we see that Lemma 7.2 remains 
true, except that we have the d x d matrix p[[fn{t) — 'Cn(s)] ( 8 ) ['Cn(^) — 'Cn(s)]] in place of 
the scalar p[[^n{t) ~'Cn(s)]^]. In (42), the functions R are now bounded and Lipschitz 
continuous from to M, but the bound remains true. Lemma 7.3 is modified as follows: 
we need A G in part (i) and A G in part (ii). Then (i) and (ii) 

with q = 1 continue to hold componentwise. In the case q = 2, the expression [^n{t) — 
is replaced by [^n{t) — Cn(s)] <8) [Cn(^) — 'Cn('S)]. The proof remains identical. 

Let us proceed to Section 8 . In Lemma 8.2, we get the Dynkin formula 

jE[Ao7rt] = E[VAo7it]-ftt{f) 

in place of (43), for any A G C'°°(M'^,M). Indeed, the modification of Lemma 7.3 above 
implies 

fi[VA{Cn{t)) ■ [Cn{t + h) - Cn(t)]] “ /x[VA(C„(f))] • /i[C„(f + h) - Cn(t)] = o(l) 
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as n —)■ oo; otherwise the proof is similar. The proof of Proposition 4.1 remains the same 
up to passing to vector notation. 

Finally, let us point out the changes in Section 9. In the tightness proof of Lemma 9.1, 
we have the bound in (45) separately for each vector component ^n\ 1 < i < d. 
The vector-valued case follows by an application of the Cauchy-Schwarz inequality. In 
Lemma 9.2, we get, for any A G M), the Dynkin formula 

E[A O TTt] = E[A o TTo] ^ V [ E[dijA O TTs] ds , (54) 

in place of (46). This identity is obtained as before, by Taylor expanding fi[A{^n{t))] and 
using the modifications of Lemmas 7.3 and 7.2 above. Let us define dtif) as the square 
root of the d x d covariance matrix (/). Then the stochastic differential equation (47), 
where Wt is an M'^-valued standard Brownian motion, has the partial differential operator 




5 E ■»?(/)« 


_k_ 

dxi dxj 


as its generator, so that the Dynkin formula for y has the same form as (54). Again, the 
martingale problem corresponding to the generator and the starting point 0 is well 
posed; see Lemma 9.3 with the change A G C'“(M'^,M). We refer to [29] for the proofs 
of these statements. Given the above changes, the proof of the martingale property in 
Proposition 9.4 generalizes to the vector-valued case in a straightforward manner, which 
shows that the limit law P of is the one of the process y appearing in (47). To complete 
the proof, note that Section 9.3 continues to apply, mutatis mutandis, switching from the 
space G°([0,1], M) of real-valued functions to G°([0,1], M'^). □ 


10.2. Proof of Theorem 3.10. Let 0 = Tq < Ti < • • • < = 1 be the endpoints 

of the partition elements A, 1 < i < m. 

The jumps in the curve 7 cause certain estimates in Section 6 to hold only piecewise, on 
each partition element. The first change occurs in Lemma 6.5: (21) continues to hold if s 
and t are in the same partition element. Likewise, (25) in Lemma 6.8 continues to hold 
in the same piecewise sense. Another change occurs in Lemma 6.9, where the condition 
on k is replaced by the piecewise analogue nfi -|- &logn < k < nfi+i for some i. In 
Lemma 6.10 the condition on t is replaced by A -|- hn~^ logn < t < A+i and s E li for 
some i. Otherwise Section 6 remains intact. To recapitulate, the essential change is that 
the difference Pn,[nti — Pt does not remain small when t passes one of the finitely many 
singularities fp to regain smallness, it is necessary to wait another 5logn iterates exactly 
as was the case at f = 0 before. Moreover, pt — Ps is only small if t and s are in the same 
partition element and close to each other. However, it is important to point out that the 
singularities do not affect the regularity bounds of the densities Pn,int\ in any way. 

In Section 7 there are similar changes: (32) holds if fi + bn~^ logn < s < A+i for some i. 
We also note that the dependence of (/) (and of Pt{f)) on t is piecewise-continuous, 
which is inconsequential; see the proof of Lemma 7.1 (and of Lemma 6.5). Lemma 7.2 
remains intact: in order to avoid singularities, one excludes in the domain of the s-integral 
on the right side of (38) a neighborhood of radius 2a„ centered at each singularity fj, which 
only results in another error of the same order na^ as before. These are the only parts in 
Section 7 that require attention. 

The above changes affect Section 8 in no way, so we are left with Section 9. There the 
proof of Lemma 3.5 stands; in the integral in (44) one has to take into account the above 
change in the condition of (32) and thus remove an interval of length bn~^\ogn at each 
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singularity fj, but this does not affect the error term. The above changes do not affect 
the rest of Section 9. 

The generalization to vector-valued observables is now achieved exactly as in the proof 
of Theorem 3.9. □ 
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